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METHOD AND APPARATUS FOR DISTRIBUTING COMMANDS 
TO A PLURALITY OF CIRCUIT BLOCKS 

BACKGROUND 

5 The invention relates to data processing, and more particularly, to commands that 

are stored in an on-chip memory. 

With the progress of semiconductor technology, it is possible to integrate complex 
circuit blocks on a single chip, chip set, or board. A single chip can have I/O cells, 
datapath operators, memory elements, and control units. Different circuit blocks perform 

10 different functions (or operations). It is the control structure's job to ensure that the 
proper circuit block performs the proper operation at the proper time. 

One example of a circuit block is an adder. Adders are used for counting, 
filtering, and multiplying. Registers are often used at the input and output of the adder to 
ensure that the inputs and outputs arrive at the same time. Another example of a circuit 

15 block is a multiplier. Multipliers are used in digital signal processing operations for 
correlations, convolutions, filtering, and frequency analysis. 

Other examples of circuit blocks include parity generators, comparators, zero/one 
detectors, boolean operators, arithmetic logic units (ALUs), and shifters. Parity 
generators are used to determine whether the number of ones in an input word is odd or 

20 even. Comparators are used to compare the magnitude of two binary numbers. Zero/One 
detectors are used to determine whether a number has all ones or all zeros. Binary 
counters are used to cycle through a sequence of binary numbers. Shifters are important 
for arithmetic shifting, logical shifting, and rotation functions. 

Memory elements are used to store, among other things, the commands for, the 

25 inputs to, and the outputs from the circuit blocks. Memory elements are usually divided 
into three categories: random access memory, serial access memory* and content access 
memory. Random access memory is usually defined as memory that has an access time 
independent of the physical location of the data. Within the general classification of 
random access memory, there are two subcategories: read only memory (ROM) and 
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read/write memory. The term RAM is usually used to refer to read/write memory. 
RAMs are used to store the outputs of the circuit blocks. 

The phrase "data processing" refers to moving data between the circuit blocks to 
achieve a particular function. This movement of data is coordinated by a control structure 
which issues new commands at the start of each clock cycle. Some systems use a SISD 
(single instruction stream single data stream) control unit, wherein instructions are 
processed one at time. Because there is only one instruction, the efficiency of a SISD 
system is usually improved by increasing the length of the instruction word or by 
increasing the clock frequency. 

In a superscalar system, the control unit can operate two or more circuit blocks 
during the same clock cycle by executing two or more instructions at the same time (in 
parallel). If there are N circuit blocks, a superscalar system can theoretically operate N 
circuit blocks during the same clock cycle. Because there are multiple instructions, 
superscalar control units are usually more efficient than SISD control units. 

The term "pipelining" refers to retrieving (fetching) instructions from an on-chip 
memory and decoding these instructions to organize them in time to operate various 
circuit blocks. The control unit can store decoded instructions in an instruction path as 
pipeline stages. The instruction path can be divided into different stages, such as, 
instruction fetch, instruction decode, register read, execute, and/or write. The control unit 
can perform comparisons to permit pass-around and ensure that operations occur in the 
proper sequence. 

Most systems use what is commonly referred to as a very long instruction word 
(VLIW) controller. In a VLIW controller, the control program is stored in an on-chip 
memory as individual words where each word corresponds to a particular clock cycle. 
Each VLIW has at least one bit field for each circuit block, even circuit blocks that are 
inactive relative to the current VLIW. One problem with VLIW controllers is that as the * 
number of circuit blocks increases, so does the length and size of the control program. In 
narticularlv comolex svstems. there is not enoueh chin soace to store the control nroeram. 
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It is not possible to manage each of the circuit blocks without devoting disproportionately 
large portions of the chip to store the instructions. 

In most systems, only a small subset of the circuit blocks are active during each 
clock cycle. As a result, only a small part of each VLIW is actually used. In other words, 
5 the control word is longer than necessary and valuable chip area is wasted. In some 

applications, it is no longer possible to fit the command memory on a single chip. There 
is a need for a control structure and a command memory that can manage multiple circuit 
blocks and issue commands in parallel without wasting valuable chip area. 

10 SUMMARY 

These and other drawbacks, problems, and limitations of conventional control 
units and command memories are overcome by consolidating the commands to reduce the 
amount of information stored in a command memory. The control unit interprets the 
commands and restores the order that was removed by the consolidation. 

15 According to one aspect of the invention, commands are stored contiguously in 

memory words in a command memory. Each command has a label field and an action 
field. A control unit receives a group of commands that are referred to collectively as a 
memory word. The control unit decodes the commands and arranges the action fields as 
commands in a control word based on information in the label field. The control unit 

20 stores the control words in a register and distributes the control words from the register to 
a plurality of circuit blocks. 

When the commands are compressed in the command memory, commands that 
are not performed in parallel can be stored in the same memory word. Commands that 
are performed in parallel can be stored in different memory words. The order of the 

25 commands in the control word is determined by information in the label field, such as 
whether the command is-performed in parallel with a preceding command. 

According to another aspect of the invention, groups of commands that are 
decoded to form time-sensitive control words or control words that are the target of jump 
commands are aligned with a memory word boundary. If a group of commands starts in 
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one memory word and ends in the next, the control unit has to read in both memory 
words. If the group of commands is the target of a jump command, two clock cycles are 
required before the group of commands is fully available. In time critical applications, it 
may be undesirable to wait two clock cycles for the group to become available. If, 
5 however, the group of commands is aligned with a memory word boundary, the control 
unit has to read in only one memory word and the control word is available sooner. In a 
particular embodiment, a special code or an illegal command is inserted in the previous 
memory word so that the group of commands begins at the beginning of the next memory 
word. 

10 According to another aspect of the invention, commands are positioned within a 

group of commands so as to further reduce the size of the command memory. For 
example, a conditional command is a command that requires a particular circuit block to 
evaluate a condition. Other commands are executed if the condition is true. In a 
exemplary embodiment of the invention, if a conditional command is the first command 

15 in a group of commands, the condition applies to all the commands in the group. If the 
conditional command is not the first command in the group, the condition only applies to 
the immediately preceding command. In other words, the positioning of the commands 
in the group imparts additional information. 

An advantage of the invention is that the command memory does not waste 

20 valuable chip area to store commands for inactive circuit blocks. The size of the 
command memory is minimized so as to use the least amount of chip area. 

Another advantage of the invention is that the same controller and the same set of 
commands can be used regardless of the amount of parallelism (the number of 
independent paths or pipelines) in a particular system. 

25 Another important advantage of the invention is that the same control fields can 

be used with slightly different chips that reuse many of the same circuit blocks. The .... 
reuse of circuit blocks is important to decreasing the time and cost associated with 
producing new systems. 
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Another important advantage of the invention is that in time-critical applications, 
groups of commands can be aligned with memory word boundaries so that they can be 
read during a single clock cycle. 

5 BRIEF DESCRIPTION OF THE DRAWINGS 

The foregoing, and other objects, features, and advantages of the invention will be 
more readily understood upon reading the following detailed description in conjunction 
with the drawings in which: 

FIG. 1 is a block diagram of a system having a control unit; 
10 FIG. 2 illustrates schematically the storing of commands in a command memory 

as memory words; 

FIG. 3 illustrates schematically the sequencing of commands by a control unit; 
FIG. 4 illustrates schematically the sequencing of commands by a control unit 
when a command indicates that a group of commands forming the next control word 
15 starts at a different position in memory and the group of commands is not aligned with a 
memory word boundary; : ; 

FIG. 5 illustrates schematically the storing of a group of commands forming a 
control word in a command memory wherein the group of commands is selectively 
aligned with a memory word boundary; and, 
20 FIG. 6 illustrates schematically the sequencing of commands by a control unit 

when a command indicates that a group of commands forming the next control word 
starts at a different position in memory and the group of commands is selectively aligned 
with a memory word boundary. 

25 DETAILED DESCRIPTION 

In the following description, specific details are-set forth m order to provide a 
thorough understanding of the invention. However, it will be apparent to those skilled in 
the art that the invention can be practiced in other embodiments that depart from these 
specific details. In other instances, detailed descriptions of well-known methods, and 
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circuits are omitted so as not to obscure the description of the invention with unnecessary 
detail. 

FIG. 1 is a block diagram of a system having a control unit 20. The control unit 
20 has a decoder 22 and a sequencer 24. The command memory 10 is an on-chip 
5 memory that is used to store commands as bit fields. Some commands are control- 
transfer commands, for example, jump or call commands, and some commands are ALU 
class commands, for example, arithmetic or logic operations. Each command has at least 
two parts: a label field and an action field. 

The control unit 20 transmits an address 41 to the command memory 10 to fetch 

10 commands 42. The control unit 20 receives from the command memory 10 a set of 
commands that are referred to collectively as a memory word. The control unit 20 
decodes the memory word using a decoder 22. A sequencer 24 arranges the action fields 
as commands in a control word based on information in the label fields. The sequencer 
24 stores each control word in a pipeline register (not shown). The order of the 

15 commands depends on the commands 42, conditions 61 from the rest of the chip 60, and 
conditions from the circuit blocks 51-59. 

A control word is a set of commands that are executed in parallel (or during a 
single clock cycle). The commands that are decoded to form the control words are stored 
contiguously in the command memory 10 as part of the memory words. The command 

20 memory 10 is reduced in size by removing unnecessary bit fields and by consolidating 
commands that may or may not be performed in parallel in the same memory word. A 
group of commands that are decoded to form a control word can begin in one memory 
word and end in the next memory word. Because the commands are compressed in the 
command memory 10, a command's identity is no longer provided by its bit position 

25 within the memory word. Instead, the sequencer 24 uses information in the label field of 
each command to organize the decoded commands in time and to restore the order that 
was removed by the consolidation. 

When the control unit 20 receives a set of commands from the command memnrv 

— . . . _ _ y 

10, the control unit 20 can use a label field to identify the length of a corresponding 
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action field. If, for example, each label field indicates whether the command is 
performed in parallel with the preceding command, the control unit 20 can use this 
information to identify the beginning and the end of a group of commands that are 
decoded to form a control word. The control unit 20 can use the label field, the action 

5 field, or the position of the command in the group of commands, to determine the circuit 
block that is to perform the corresponding action field. If a group of commands does not 
have a command directed to a particular circuit block then the control unit 20 knows that 
the circuit block is temporarily inactive. If a circuit block is inactive, it remains inactive 
until there is a control word with a command directed to it. 

10 The control unit 20 issues a control word on control bus 43 for each clock cycle 

from clock 30. The control unit 20 pre-fetches commands 42 from memory 10 at a rate 
sufficient to have a full control word available for each clock cycle from clock 30. The 
control word is delivered to circuit blocks 51, 52, 53, and 59. The control word can 
instruct circuit block 51,52, 53, or 59 to send or retrieve data on system data bus 44. The 

15 control word can instruct circuit block 51, 52, 53, or 59 to process or manipulate data 
received on data bus 44. The control unit 20 can have multiple sequencers and/or 
multiple pipeline registers. The system can have multiple control buses and/or multiple 
data buses. 

The control unit 20 issues control words with commands to the active circuit 
20 blocks. If a group of commands forming a control word does not have a command 

directed to a particular circuit block, the control unit 20 knows that this circuit block is 
temporarily inactive. If a circuit block is inactive, the circuit block remains inactive until 
there is a control word with a command directed to it. If, for example, control unit 20 
decodes a group of commands forming a control word and the group of commands 
25 includes commands for circuit blocks 51 and 52, but no commands directed to circuit 
blocks 53 and 59, the control unit 20 issues a control word with commands directed to 
circuit blocks 51 and 52. 

FIG. 2 illustrates schematically the storing of commands in a command memory 
as memory words. As shown in FIG. 2, command memory 10 has memory words 11, 12, 
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13, 14, 15, ... , and 19. The commands 71, 72, . . . , and 85 are decoded and arranged to 
form control words. In some applications, the control words have a maximum length that 
is limited by the available hardware resources such as the number of data and control 
buses. The maximum length of the control word is also limited by the number of bits the 
5 control unit reads from the command memory during a single clock cycle. 

In an exemplary embodiment, control words are multiples of a basic length such 
as 8 bits or 16 bits. For example, in FIG. 2, the maximum number of data bits read in 
during a single clock cycle is 64 bits. The memory word length is 64 bits and the 
commands are either 16 or 32 bits long. A group of commands that are decoded to form a 

10 control word can be 16, 32, 48, or 64 bits long, i.e. one, two, three, or four commands 
long. It is possible for a control word to begin in one memory word and end in the next 
memory word. For example, if command 71 forms the first control word, commands 72, 
73, 74, and 75 can form the next control word. 

Memory word 1 1 is formed by the commands 71, 72, 73, and 74. Commands 71, 

15 72, 73, and 74 can form a control word. If commands 71, 72, 73, and 74 are decoded to 
form a control word, then commands 71, 72, 73, and 74 are performed during the same 
clock cycle (in parallel). Each command has at least two parts: a label field and an action 
field. In an exemplary embodiment, the label field is four to seven bits long. The label 
field can identify the circuit block to which the action field is directed. If, for example, 

20 command 71 is directed to circuit block 51, the label field of command 71 identifies 
circuit block 5 1 as the circuit block that is to perform the action. If memory word 1 1 
does not have a command directed to a particular circuit block, the control unit 20 knows 
that this circuit block is temporarily inactive. Some of the commands are directed to the 
control unit 20. If a command is directed to the control unit 20, it is not necessary to 

25 issue the command on the control bus 43. 

The length of the control word-is determined by information present in the-label 
field. The label field can provide information that was conventionally provided by the bit 
position of the command in a longer control word. In addition to identifying the circuit 
block to which the action field is directed, the label field can also indicate whether a 
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command is to be performed in parallel with the immediately preceding command. If 
two commands are performed in parallel, they form part of the same control word. The 
decoding and sequencing of the commands is performed by the control unit 20. If, for 
example, commands 71, 72, 73, 74 form a control word, the label fields of commands 72, 
73, and 74 indicate that commands 72, 73, and 74 are to be performed in parallel with the 
immediately preceding command. If, however, command 74 is not part of the same 
control word, the label field of command 74 would indicate that command 74 is not to be 
performed in parallel with the immediately preceding command 73. 

A control word can have any of several different types of commands. A 
command is usually one of four basic action types. Move commands control the 
movement of data between circuit blocks. Operation commands control the operations 
performed by the circuit blocks. Program flow commands control the sequencing of 
commands. Conditional commands are used to evaluate a condition. A command can 
also be a combination of several different types. 

In an exemplary embodiment of the present invention, the commands are 
positioned in a group of commands that are decoded to form a control word so as to 
impart additional information and further reduce the size of the command memory. For 
example, the action resulting from a conditional command can depend on its position in 
the group of commands. If, for example, the conditional command is the first command 
in the group, the condition applies to all the commands in the group. That is, if the 
condition is true, all the following commands are performed. If the condition is false, 
none of the commands are performed. If, on the other hand, the conditional command is 
not the first command in the group of commands, the condition only applies to the 
immediately preceding command. 

FIG. 3 illustrates schematically the sequencing of commands by a control unit. 
The sequencing of commands in the control unit depends on the commands and 
conditions from the circuit blocks and the rest of the chip. In an exemplary embodiment, 
there are three pipeline phases: the command fetch phase, the command decode phase, 
and the execute/write phase. During the fetch phase, a memory word is read from the 
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command memory and the commands forming the memory word are stored in the 
pipelining register. During the decode phase, the control unit reads each label field and 
positions each corresponding action field in a control word based on information in each 
label field. The action field is positioned in the control word so as to restore the 
positional information that was removed when the commands were compressed in the 
command memory. During the execute/write phase, the control word is delivered to the 
circuit blocks. The circuit blocks perform data moves and other operations. The 
resulting data is stored in a destination register or memory. 

In the example shown in FIG. 3, the control unit fetches memory word 1 1 (shown 
in FIG. 2) during clock cycle 0. As mentioned above, the control unit prefetches memory 
words from the command memory and stores them in the pipeline register at a rate 
sufficient to have a full control word available for each clock cycle. During clock cycle 
1, the control unit fetches memory word 12 (shown in FIG. 2) and memory word 1 1 is 
decoded. During clock cycle 2, the control unit fetches memory word 13 (shown in FIG. 
2), memory word 12 is decoded, and the commands— if a full control word is available—is 
delivered to the circuit blocks. During clock cycle 3, the control unit fetches memory 
word 14 (shown in FIG. 2), decodes memory word 13, and executes the next control 
word. During clock cycle 4, the control unit fetches memory word 15 (shown in FIG. 2), 
decodes memory word 14, and executes the next control word. Note that in some cases a 
control word can start in one memory word and end in the next memory word. In the 
example shown, commands 81, 82, and 83 are decoded to form a control word. Thus, 
even though command 81 is decoded during clock cycle 3, it is not executed until clock 
cycle 5. Commands that are to be performed in parallel are usually not executed until all 
the commands are available. 

Usually the commands are decoded to form control words in the same order that 
they are stored in the memory* 10. However, the memory 10 can have commands that 
indicate that the next control word starts at a different position in memory. For example, 
one of a group of commands that are decoded to form a control word can be the target of 
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a jump command. If a one of the commands is the target of a jump command, the group 
of commands should be aligned with a memory word boundary. 

FIG. 4 illustrates schematically the sequencing of commands by a control unit 
when a command indicates that a group of commands forming the next control word 
starts at a different position in memory and the group of commands is not aligned with a 
memory word boundary. In the example shown, command 81 is the first command in a 
control word formed from commands 81, 82, and 83. In the example shown in FIG. 2, 
command 81 is stored in memory word 13 and commands 82 and 83 are stored in 
memory word 14. 

If, during clock cycle 0, a jump command is executed and command 81 is the 
target of the jump command, the next memory word read by the control unit is memory 
word 13. However, the commands 81, 82, and 83 are not all stored in memory word 13. 
This means that it takes two clock cycles before the control word is fully available. 
During clock cycle 2, the control unit fetches memory word 14. Because it takes two 
clock cycles to read in the commands necessary to from the control word, the control 
word is not ready in clock cycle 3. In time critical applications, a control word should be 
ready for every clock cycle. Thus, groups of commands that are decoded to form time 
sensitive control words should be aligned with a memory word boundary so that the 
group of commands are fetched during a single clock cycle. 

FIG. 5 illustrates schematically the storing a group of commands forming a 
control word in a command memory wherein the group of commands is selectively 
aligned with a memory word boundary. In the example shown, command 81 is the first 
command in a control word formed from commands 81, 82, and 83. In FIG. 2, command 
81 was the last command in memory word 13. In FIG. 5, command 90 is inserted in 
memory word 13. As a result, command 81 is the first command in memory word 14. 

Thereare many different ways to pad a memory word. Command,90 can be any 
special command code that the control unit is programmed to recognize. Another way to 
pad an information word is to use an illegal code. The control unit can recognize 
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command's label field command 90 as an illegal command by looking at just a few bits in 
the command's label field. 

Another way that the control unit can determine that command 90 is an illegal 
command, is if command 90 exceeds either the maximum length for a group of 
5 commands that are decoded to form a control word or the maximum length for a control 
word. If commands 77, 78, 79 and 80 form a control word, then command 90 would 
exceed a maximum length. If a bit in the label field indicates that command 90 is to be 
performed in parallel with commands 77, 78, 79, and 80, the control unit knows that 
command 90 is an illegal command and that the next control word begins in the next 

10 memory word. 

Another way to pad an information word is to use a program flow command. In 
some exemplary embodiments, program flow commands are always the first command in 
a group of commands. If a program flow command is not the first command in the group 
of commands, then the control unit knows that the program flow command is an illegal 
15 command. Groups of commands can also be set-up so that there is only one program 

flow command per memory word. If there is more than one program flow command, the 
control unit knows that the second program flow command is an illegal command. When 
the control unit reads in a special code or an illegal command, the control unit knows that 
the next control word starts at the beginning of the next memory word. 
20 FIG. 6 illustrates schematically the sequencing of commands by a control unit 

when a command indicates that a group of commands forming the next control word 
starts at a different position in memory and the group of commands is selectively aligned 
with a memory word boundary. In the example shown, command 81 is the first 
command in a control word formed from commands 81, 82, and 83. As stored in FIG. 5, 
25 command 81 is the first command in memory word 14. As a result, the group of 
commands 8 1 , 82y and 83. can be read in during a single clock cycle. 

If during clock cycle 0, a jump command is executed and command 81 is the 
target of a jump command, the next memory word read by the control unit is memory 
word 14. During clock cycle 1, the control unit reads memory word 14. Unlike FIG. 4, 
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the group of commands forming the control word is fetched during a single clock cycle. 
During clock cycle 2, the memory word 14 is decoded and during clock cycle 3, the 
control word formed from commands 81, 82, and 83 is executed. Unlike FIG. 4, a 
control word is ready for every clock cycle. Even though control words can start in one 

5 memory word and end in the next memory word, groups of commands that are decoded 
to form time sensitive control words should be aligned with memory word boundaries. 

Because the label field can provide information about the length of the commands 
and the circuit blocks to which the action fields are directed, the command memory and 
the control unit can be reused with other circuit blocks. Similarly, the circuit blocks can 

10 be reused with other command memories and control units. 

The preceding description of the preferred embodiments are provided to enable 
any person skilled in the art to make use the present invention. Various modifications to 
these embodiments will be readily apparent to those skilled in the art, and the principles 
described herein may be applied without departing from the scope and the spirit of the 

15 present invention. Thus, the present invention is not limited to the disclosed 

embodiments, but is to be accorded the widest scope consistent with the claims below. 
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WHAT IS CLAIMED IS: 

1 . An apparatus for distributing commands to a plurality of circuit blocks, the 

apparatus comprising: 

a command memory for storing commands contiguously in memory words, the 
5 commands including a label field and an action field; 

a control unit for receiving the memory words, decoding the memory words, 
arranging the action fields as commands in a control word based on information in the 
label field, and storing the control words in a register; and 

means for distributing the control words from the register to the plurality of circuit 

10 blocks. 

2. An apparatus as described in claim 1, wherein the command memory is 
reduced in size by consolidating commands that may or may not be performed in parallel 
in one of the memory words. 

15 

3. An apparatus as described in claim 2, wherein a group of commands that are 
decoded to form a control word begin in a first memory word and end in a second 
memory word. 

20 4. An apparatus as described in claim 3, wherein each label field indicates 

whether a command is performed in parallel with a preceding command. 

5. An apparatus as described in claim 1, wherein a group of commands that are 
decoded to form a control word is aligned with a memory word boundary so that the 

25 group of commands does not begin in a first memory word and end in a second memory 
word.' 

6. An apparatus as described in claim 5, wherein the group is aligned with the 
memory word boundary by inserting a special code in a preceding memory word. 
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7. An apparatus as described in claim 5, wherein the group is aligned with the 
memory word boundary by inserting an illegal command in a preceding memory word. 

8. An apparatus as described in claim 7, wherein each of the control words has a 
maximum possible length and the sequencer recognizes the illegal command as an illegal 
command because the illegal command would exceed the maximum possible length. 

9. An apparatus as described in claim 1, wherein there are different types of 
commands and one type of command is a jump command and for every jump command 
there is a target command; and, 

if a command in a group of commands that are decoded to form a control word is 
a target command, the group of commands is aligned with a memory word boundary. 

1 0. An apparatus as described in claim 1 , wherein there are different types of 
commands and one type of command directs a circuit block to evaluate a condition and if 
the condition is true, at least one other command stored in the command memory is 
executed, and if the condition is false, the at least one other command is not executed. 

11. An apparatus as described in claim 10, wherein if a conditional command is 
the first command in a group of commands that are decoded to form a control word, the 
condition applies to all the commands in the group of commands. 

12. An apparatus as described in claim 10, wherein if a conditional command is 
not the first command in a group of commands that are decoded to form a control word, 
the condition applies to less than all the commands in the group of commands. 

13. A method for providing commands to circuit blocks, the method comprising 
the steps of: 
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storing a plurality of commands in a command memory as memory words, the 
commands including a label field and an action field; 

retrieving a memory word from the command memory; 
decoding the memory word; 
5 placing an action field from an command in a position in a register based upon 

information in the label field; and, 

providing the command to the circuit blocks from the register. 

14. A method as described in claim 13, further comprising the step of: 

10 reducing the size of the command memory by consolidating commands that may 

or may not be performed in parallel in a memory word. 

15. A method as described in claim 14, further comprising the step of padding a 
memory word so that a group of commands that are to be performed in parallel are stored 

15 in the same memory word. 



BNSDOCID: <WO 996341 7A2 J _> 



WO 99/63417 



PCT/SE99/00847 



1/5 




CO 



BNSDOCiD: <WO. 



^996341 7 A2_L> 



WO 99/63417 



PCT/SE99/00847 



FIG. 2 



WORD 19 












WORD 15 


COMMAND 85 








WORD 14 


COMMAND 82 


COMMAND 83 


COMMAND 84 


WORD 13 


COMMAND 78 


COMMAND 79 


COMMAND 80 


COMMAND 81 


WORD 12 


COMMAND 75 


COMMAND 76 


COMMAND 77 


WORD 11 


COMMAND 71 


COMMAND 72 


COMMAND 73 


COMMAND 74 



FIG. 5 



WORD 19 












WORD 15 


COMMAND 84 


COMMAND 85 






WORD 14 


COMMAND 81 


COMMAND 82 


COMMAND 83 


WORD 13 


COMMAND 78 


COMMAND 79 


COMMAND 80 


COMMAND 90 


WORD 12 


CQMMAJUD 75 


GOMMAND 78, 


COMMAND 77 


WORD 11 


COMMAND 71 


COMMAND 72 


COMMAND 73 


COMMAND 74 



BNSDOCID: <WO 996341 7A2_L> 



WO 99/63417 



PCT/SE99/00847 



FIG. 3 



CYCLE 


COMMAND FETCH 

rnfioc 


COMMAND DECODE 
PHASE 


EXECUTE/WRITE 
PHASE 


0 


COMMAND 71 
COMMAND 72 
COMMAND 73 
COMMAND 74 






1 


COMMAND 75 
COMMAND 76 
COMMAND 77 


COMMAND 71 
COMMAND 72 
COMMAND 73 
COMMAND 74 




2 


COMMAND 78 
COMMAND 79 
COMMAND 80 
COMMAND 81 


COMMAND 75 
COMMAND 76 
COMMAND 77 


COMMAND 71 
COMMAND 72, 
COMMAND 73 
COMMAND 74 


3 


COMMAND 82 
COMMAND 83 
COMMAND 84 


COMMAND 78 
COMMAND 79 
COMMAND 80 
COMMAND 81 


COMMAND 75 
COMMAND 76 
COMMAND 77 


4 


COMMAND 85 


COMMAND 82 
COMMAND 83 
COMMAND 84 


COMMAND 78 
COMMAND 79 
COMMAND 80 


5 




COMMAND 85 


COMMAND 81 
COMMAND 82 
COMMAND 83 


6 






COMMAND 84 
COMMAND 85- 



BNSDOCID: <WO 996341 7A2_I_> 



WO 99/63417 



PCT/SE99/00847 



4/5 



FIG. 4 



CYCLE 


COMMAND FETCH 
PHASE 


COMMAND DECODE 
PHASE 


EXECUTE /WRITE 

t/IL I/IS 1 L. f WW ft ' 1 L 

PHASE 


0 


COMMAND 71 
COMMAND 72 
COMMAND 73 
COMMAND 74 






1 


COMMAND 78 
COMMAND 79 

COMMAND 8/ 


COMMAND 71 
COMMAND 72 
COMMAND 73 
COMMAND 74 




2 


COMMAND 82 
COMMAND 83 
l/UMMANU ov 


COMMAND 81 


COMMAND 71 
COMMAND 72 
COMMAND 73 
COMMAND 74 


3 


COMMAND 85 


COMMAND 82 
COMMAND 83 

COMMAND 84 




4 




COMMAND 85 


COMMAND 81 
COMMAND 82 
COMMAND 83 


5 






COMMAND 84 
COMMAND 85 


•* 

& 









BNSDOCID: <WO 996341 7A2_I_> 



WO 99/63417 



PCT/SE99/00847 



FIG. 6 



CYCLE 


COMMAND FETCH 
PHASE 


COMMAND DECODE 

DLIA CP 

rnHOL 


EXECUTE/WRITE 

rfi/iOL 


0 


COMMAND 71 
COMMAND 72 
COMMAND 73 
COMMAND 74 






1 


COMMAND 81 
COMMAND 82 
COMMAND 83 


COMMAND 71 
COMMAND 72 
COMMAND 73 
COMMAND 74 




2 


COMMAND 84 
COMMAND 85 


COMMAND 81 
COMMAND 82 
COMMAND 83 


COMMAND 71 
COMMAND 72 
COMMAND 73 
COMMAND 74 


3 




COMMAND 84 
COMMAND 85 


COMMAND 81 
COMMAND 82 
COMMAND 83 


4 






COMMAND 84 
COMMAND 85 


5 








6 









BNSDOCID: <WO 99634 17A2J_> 



This Page Blank (uspto) 



WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 




PCT 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification 6 
G06F 9/38 



A3 



(11) International Publication Number: WO 99/63417 

(43) International Publication Date: 9 December 1999 (09.12.99) 



(21) International Application Number: PCT/SE99/00847 

(22) International Filing Date: 18 May 1999 (18.05.99) 



(30) Priority Data: 

09/086,618 



29 May 1998 (29.05.98) 



US 



(71) Applicant: TELEFON A KTIEB OL AGET LM ERICSSON 

(publ) [SE/SE]; S-126 25 Stockholm (SE). 

(72) Inventors: HAMMAR, Claes; Travarevagen 14, S-187 54 

Taby (SE). JACOBSSON, Magnus; Osterangsvagen 23 
B, S-182 46 Enebyberg (SE). HAKANSSON, Stefan; 
Storstigen 14, S-974 32 Lulea (SE). 

(74) Agent: ERICSSON RADIO SYSTEMS AB; Common Patent 
Dept., S-164 80 Stockholm (SE). 



(81) Designated States: AE, AL, AM, AT, AU, AZ, BA, BB, BG, 
BR BY, CA, CH, CN, CU, CZ, DE, DK, EE, ES, FI, GB, 
GD t GE, GH, GM, HR, HU, ID, IL, IN, IS, JP, KE, KG, 
KP KR KZ, LC, LK, LR, LS, LT, LU, LV, MD, MG, MK, 
MN, MW, MX, NO, NZ, PL, PT, RO, RU, SD, SE, SG, SI, 
SK, SL, TJ, TM, TR, TT, UA, UG, UZ, VN, YU, ZA, ZW, 
ARIPO patent (GH, GM, KE, LS, MW, SD, SL, SZ, UG, 
ZW), Eurasian patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, 
TM), European patent (AT, BE, CH, CY, DE, DK, ES, FI, 
FR, GB, GR, IE, IT, LU, MC, NL, PT, SE), OAPI patent 
(BF, BJ, CF, CG, CI, CM, GA, GN, GW, ML, MR, NE, 
SN, TD, TG). 



Published 

With international search report. 

(88) Date of publication of the international search report: 

16 March 2000 (16.03.00) 



(54) Title: METHOD AND APPARATUS FOR DISTRIBUTING COMMANDS TO A PLURALITY OF PROCESSING UNITS 

(57) Abstract _ V 

A command memory stores commands in memory words. Each command has a label field and an action field. The commands are 
consolidated to reduce the amount of information stored in the command memory. A control unit interprets the commands and reswres 
the order that was removed by the consolidation. The control unit arranges the action fields as commands in a control word based on 
information in the label field. When the commands are compressed in the command memory, commands that are not Performed .n parallel 
can be stored in the same memory word. Commands that are performed in parallel can be stored in different memory words. The order of 
the commands in the control word is determined by information in the label field, such as whether the command is performed .n parallel 
with a preceding command. Groups of commands that are decoded to form time-sensitive control words or control words thai .are the 
target of jump commands are aligned with a memory word boundary so that the commands forming the control word can be read during 
a single clock cycle. The group of commands can be aligned with a memory word boundary by inserting a special code or an illegal 
command in the previous memory word. Commands can be positioned within a group of commands so as to further reduce the size of the 
command memory. If a conditional command is the first command in a group of commands, the condition applies to all the commands 
in the group. If a conditional command is not the first command in a command sequence, the condition applies to only the immediately 
preceding command. 



BNSDOCID: <WO 996341 7A3J_> 




FOR THE PURPOSES OF INFORMATION ONLY 
Codes used to identify States party to the PCT on the front pages of pamphlets publishing international applications under the PCT. 



AL 


Albania 


ES 


Spain 


LS 


Lesotho 


SI 


Slovenia 


AM 


Armenia 


FI 


Finland 


LT 


Lithuania 


SK 


Slovakia 


AT 


Austria 


FR 


France 


LU 


Luxembourg 


SN 


Senegal 


AU 


Australia 


GA 


Gabon 


LV 


Latvia 


sz 


Swaziland 


AZ 


Azerbaijan 


GB 


United Kingdom 


MC 


Monaco 


TD 


Chad 


BA 


Bosnia and Herzegovina 


GE 


Georgia 


MD 


Republic of Moldova 


TG 


Togo 


BB 


Barbados 


GH 


Ghana 


MG 


Madagascar 


TJ 


Tajikistan 


BE 


Belgium 


GN 


Guinea 


MK 


The former Yugoslav 


TM 


Turkmenistan 


BF 


Burkina Faso 


GR 


Greece 




Republic of Macedonia 


TR 


Turkey 


BG 


Bulgaria 


HU 


Hungary 


ML 


Mali 


TT 


Trinidad and Tobago 


BJ 


Benin 


IE 


Ireland 


MN 


Mongolia 


UA 


Ukraine 


BR 


Brazil 


IL 


Israel 


MR 


Mauritania 


UG 


Uganda 


BY 


Belarus 


IS 


Iceland 


MW 


Malawi 


US 


United States of America 


CA 


Canada 


IT 


Italy 


MX 


Mexico 


uz 


Uzbekistan 


CF 


Central African Republic 


JP 


Japan 


NE 


Niger 


VN 


Viet Nam 


CG 


Congo 


KE 


Kenya 


NL 


Netherlands 


YU 


Yugoslavia 


CH 


Switzerland 


KG 


Kyrgyzstan 


NO 


Norway * 


ZW 


Zimbabwe 


CI 


Cdte d'lvoire 


KP 


Democratic People's 


NZ 


New Zealand 






CM 


Cameroon 




Republic of Korea 


PL 


Poland 






CN 


China 


KJR 


Republic of Korea 


PT 


Portugal 






CU 


Cuba 


KZ 


Kazakstan 


RO 


Romania 










■ /-* 




mi 










Czech Republic 














DE 


Germany 


LI 


Liechtenstein 


SD 


Sudan 






DK 


Denmark 


LK 


Sri Lanka 


SE 


Sweden 






EE 


Estonia 


LR 


Liberia 


SG 


Singapore 







BNSDOCID: <WO 996341 7A3_I_> 



INTERNATIONAL SEARCH REPORT 



International application No. 
PCT/SE 99/00847 



A. CLASSIFICATION OF SUBJECT MATTER 

IPC 6 G06F9/38 

According to International Patent Classificatio n (IPC) or to both national classification and PC 

B. FIELDS SEARCHED . 

Minimum documentation searched (classification system followed by classification symbols) 

IPC 6 G06F 

Documentation searched other than minimum documentation to the extent that such documents are included in the fields searched 
Electronic data base consulted during the international search (name of data base and, where practicable, search terms used) 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Category* 


Citation of document, with indication, where appropriate, of the relevant passages 


Relevant to claim No. 


X 
Y 

Y 
A 


EP 0 652 510 A (INTERGRAPH CORP) 

10 May 1995 (1995-05-10) 

column 3, line 48 -column 4, line 18 

column 6, line 38 -column 7, line 24 

column 8, line 13 - line 22 

column 11, line 34 -column 12, line 54 

ADAMS R ET AL: "A PARALLEL PIPELINED 
PROCESSOR WITH CONDITIONAL INSTRUCTION 
EXECUTION" 

COMPUTER ARCHITECTURE NEWS, US, ASSOCIATION 
FOR COMPUTING MACHINERY, NEW YORK, 
vol. 19, no. 1, March 1991 (1991-03), page 
135-142 XP000201932 

ISSN: 0163-5964 
page 138 


1,2,4,5, 

13,14 

10,12 

10,12 
11 


Further documents are listed in the continuation of Box C. £>^J See patent family annex. 


Special categories of cited documents: *T" later document published after the international filing date or pnonty 

apeciai caicgonc* ui ««« { j atc ^ not m conflict with the application but cited to understand 
"A" document defining the general state of the art which is not considered ^ p^^^ OT theory underlying the invention 

to be of particular relevance , . 
«E" document but pubfctad on or after the .n^cnal filing da* "X" ^^^^^^^S^ 
"L** document which may throw doubts on priority claixn(s) or which is step when the document is taken alone 

cited to -gtohiich the publication date of another citation or other , 

special reason (as specified) "Y" document of particular relevance; the claimed invention cannot be 
, c . t j * i . ;kW . B „ ^ w considered to involve an inventive step when the document is 
"CT document referring to an oral disclosure, use, exhibition or other combined with one or mo^e other such documents, such combination 

means being obvious to a person skilled in the art 
"P" document published prior to the international filing date but later than 

the priority date claimed & document member of the same patent family 


Date of the actual completion of the international search 

9 November 1999 


]>it^ of jnaihng of the international search report 

2 9. 11. 99 


Name and mailing address of the ISA/ 
Eiraaem Patent Omoe. P.a 581 8 Pattrttaan 2 
NL-2ZBOHVR*** 
Te4. (491-70) 340-2040. Tx. 31 6S1 epo rt. 
Fwc (+31-70) 340-3016 


Authorized officer 

Daskalakls, T 

Telephone No. 



Form PCT/ISA^ia (second sheet) (July 1992) 



BNSDOCID: <WO 996341 7A3J_> 



IO^I 



INTERNATIONAL SEARCH REPORT 



International application No. 
PCT/SE 99/00847 



C (Continuation). DOCUMENTS CONSIDERED TO BE RELEVANT 



Category* 



Citation of document, with indication, where appropriate, of the relevant passages 



Relevant to claim No. 



EP 0 455 966 A (IBM) 

13 November 1991 (1991-11-13) 

column 7, line 12 -column 8, line 22 

column 12, line 8 - line 39 

column 13, line 31 -column 14, line 1 

column 14, line 39 -column 17, line 28 

EP 0 825 529 A (IBM) 

25 February 1998 (1998-02-25) 

the whole document 

EP 0 473 420 A (HITACHI LTD) 
4 March 1992 (1992-03-04) 
page 4, line 19 - line 32 

EP 0 724 213 A (IBM) 

31 July 1996 (1996-07-31) 

the whole document 



1,2,4,5, 
13,14 



1-4,9, 
13,14 



1,13 



US 5 701 430 A (JEREMIAH THOMAS LEO 
AL) 23 December 1997 (1997-12-23) 
claim 1 



ET 



EP 0 488 819 A (TOKYO SHIBAURA ELECTRIC 
CO) 3 June 1992 (1992-06-03) 
the whole document 

MAH0N M J ET AL: "HEWLETT-PACKARD 
PRECISION ARCHITECTURE: THE PROCESSOR" 
HEWLETT-PACKARD JOURNAL , US , HEWLETT-PACKARD 
CO. PALO ALTO, 

vol. 37, no. 8, August 1986 (1986-08), 
page 4-22 XP0002 11314 
page 11, right-hand column 



10-12 



10,12 



Form PCT/ISA/210 (continuation of second sheet) (July 1992) 

<WO 99634 1 7A3_I_> 



INTERNATIONAL SEARCH REPORT 



International application No. 

PCT/SE 99/00847 



Box I Observations where certain claims were found unsearchable (Continuation of item 1 of first sheet) 



This International Search Report has not been established in respect of certain claims under Article 17{2)(a) for the following reasons: 



1.1 Claims Nos.: „ 

because they relate to subject matter not required to be searched by this Authority, namely: 



2. Claims Nos.: , . . . . . 

because they relate to parts of the International Application that do not comply with the prescnbed requirements to such 

an extent that no meaningful International Search can be carried out, specifically: 



3. Claims Nos.: * t a , a a/ \ 
because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 

Box II Observations where unity of invention is lacking (Continuation of item 2 of first sheet) 

This International Searching Authority found multiple inventions in this international application, as follows: 



see additional sheet 



1 . f"~~| As all required additional search fees were timely paid by the applicant, this International Search Report covers all 
L^-l searchable claims. 



2 CZI al1 searcnable claims could be searched without effort justifying an additional fee, this Authority did not invite payment 



of any additional fee. 



3. I I As only some of the required additional search fees were timely paid by the applicant, this International Search Report 
' 1 covers only those claims for which fees were paid, specifically claims Nos.: 



4. |~| No required additional search fees were timely paid by the applicant. Consequently, this International Search Report is 
restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 



Remark on Protest [ [ The additional search fees were accompanied by the applicant's protest. 

[ )( j No protest accompanied the payment of additional search fees. 



Form PCT/1SA/21 0 (continuation of first sheet (1 )) (July 1998) 

BNSDOCID: <WO 9963417A3_I_> 




INTERNATIONAL SEARCH REPORT |ntemationa , App|ication No pcT/SE 9 9/eo 847 

FURTHER INFORMATION CONTINUED FROM PCT/ISA/ 210 

This International Searching Authority found multiple (groups of) 
inventions in this international application, as follows: 

1. Claims: 1-9, 13-15 

Apparatus for distributing comnands constituting compound 
instructions to parallel functional units. 

2. Claims: 10-12 

Conditional execution of coninands constituting a compound 
instruction. 



BNSDOCID: <WO 996341 7A3_I_> 



INTERNATIONAL SEARCH REPORT 



PCT/SE 99/00847 



Patent document 


KUDSC&uOn 




Patent famllv 

rWnii urn hi y 


PU>8catton 




date 




members) 


dole 


EP 0652510 A 


10-05-1995 


EP 


0950946 A 


20-10-1999 


US 


5560028 A 


24-09-1996 


EP 0455966 A 


13-11-1991 


US 


5214763 A 


25-05-1993 




AT 


131637 T 


15-12-1995 




CA 


2040304 C 


11-04-1995 




CS 


9100933 A 


13-05-1992 




DE 


69115344 D 


25-01-1996 




DE 


69115344 T 


20-06-1996 




HU 


214423 B 


30-03-1998 




JP 


2039125 C 


28-03-1996 




JP 


4230528 A 


19-08-1992 




JP 


7076924 B 


16-08-1995 




JP 


2046578 C 


25-04-1996 




JP 


6083623 A 


25-03-1994 




JP 


7078737 B 


23-08-1995 




PL 


165491 B 


30-12-1994 




US 


5465377 A 


07-11-1995 




US 


5475853 A 


12-12-1995 




US 


5295249 A 


15-03-1994 


EP 0825529 A 


25-02-1998 


US 


5197135 A 


23-03-1993 




US 


5295249 A 


15-03-1994 




CA 


2037708 A t C 


05-11-1991 




CS 


9100935 A 


17-12-1991 




DE 


69122294 D 


31-10-1996 




DE 


69122294 T 


10-04-1997 




EP 


0454984 A 


06-11-1991 




EP 


0481031 A 


22-04-1992 




EP 


0545927 A 


16-06-1993 




JP 


7073036 A 


17-03-1995 




JP 


8007681 B 


29-01-1996 




JP 


6080489 B 


12-10-1994 




JP 


4505823 T 


08-10-1992 




JP 


6079273 B 


05-10-1994 




JP 


4506878 T 


26-11-1992 




PL 


166513 B 


31-05-1995 




WO 


9117496 A 


14-11-1991 




WO 


9117495 A 


14-11-1991 




US 


5448746 A 


05-09-1995 




US 


5500942 A 


19-03-1996 




US 


5504932 A 


02-04-1996 




US 


5303356 A 


12-04-1994 




AT 


146611 T 


15-01-1997 




CA 


2039640 A 


05-11-1991 




CA 


2040637 A,C 


05-11-1991 




CS 


9100936 A 


17-12-1991 




DE 


69123629 D 


30-01-1997 




DE 


69123629 T 


12-06-1997 




EP 


0454985 A 


06-11-1991 




JP 


2500082 B 


29-05-1996 




JP 


4229326 A 


18-08-1992 




PL 


165524 B 


31-01-1995 




US 


5465377 A 


07-11-1995 




US 


5502826 A 


26-03-1994 




US 


5732234 A 


24-03-1998 




EP 


0463299 A 


02-01-1992 




JP 


2046580 C 


25-04-1996 



Fom PCTA8A/210 <p«M tvrtfyamx) <Ji4y 1902) 



page I of 2 



BNSDOCID: <WO 996341 7A3J_> 



INTERNATIONAL SEARCH REPORT 

Irtotnwtton on patent twntty nwnbwi 


Ink joneJi 

PCT/SE < 


Ippfecation No 
99/00847 


Patent document 1 Plantation 
cted k\ search report date 


Patent tamBy 
members) 


Plantation 
date 



EP 0825529 



JP 
JP 
EP 
JP 
JP 
JP 
US 
US 



7082438 B 



0496928 A 
2046578 C 
6083623 A 
7078737 B 
5446850 A 
5701430 A 



06-09-1995 
05-08-1992 
25-04-1996 
25-03-1994 
23-08-1995 
29-08-1995 
23-12-1997 



EP 0473420 A 04-03-1992 US 5163139 A 10-11-1992 

JP 4313121 A 05-11-1992 

EP 0724213 A 31-07-1996 US 5742784 A 21-04-1998 

JP 8249180 A 27-09-1996 



US 


5701430 


A 


23- 


•12-1997 


US 


5446850 A 


29-08-1995 








US 


5295249 A 


15-03-1994 












US 


5303356 A 


12-04-1994 












CA 


2040637 A,C 


05-11-1991 












EP 


0496928 A 


05-08-1992 












EP 


0481031 A 


22-04-1992 












EP 


0545927 A 


16-06-1993 












EP 


0825529 A 


25-02-1998 












JP 


2046578 C 


25-04-1996 












JP 


6083623 A 


25-03-1994 












JP 


7078737 B 


23-08-1995 












JP 


6080489 B 


12-10-1994 












JP 


4505823 T 


08-10-1992 












JP 


6079273 B 


05-10-1994 












JP 


4506878 T 


26-11-1992 












UO 


9117496 A 


14-11-1991 












UO 


9117495 A 


14-11-1991 












US 


5465377 A 


07-11-1995 












US 


5448746 A 


05-09-1995 












US 


5504932 A 


02-04-1994 












CA 


2053941 A 


30-09-1992 


EP 


0488819 


A 


03- 


-06-1992 


JP 


4308946 A 


30-10-1992 








JP 


2835179 B 


14-12-1998 












JP 


4205625 A 


27-07-1992 












DE 


69130757 D 


25-02-1999 












DE 


69130757 T 


29-07-1999 












US 


5617553 A 


01-04-1997 












US 


5446849 A 


29-08-1995 



BNSDOCID: <WO 996341 7A3J_> 



page 2 of 2* 



